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Abstract 

A variant of Kauffman's model of cellular metabolism is presented. It is a randomly 
generated network of boolean gates, identical to Kauffman's except for a small bias in favor 
of boolean gates that depend on at most one input. The bias is asymptotic to as the 
number of gates increases. Upper bounds on the time until the network reaches a state cycle 
and the size of the state cycle, as functions of the number of gates n, are derived. If the bias 
approaches slowly enough, the state cycles will be smaller than for some c < 1. This 
lends support to Kauffman's claim that in his version of random network the average size of 
the state cycles is approximately n^^'^. 

Proposed running head: Antichaos in Random Boolean Nets 

Keywords: ceUular automata, random graphs, stabihty. 

1 Introduction 



Let n be a natural number. A boolean cellular automaton with n gates consists of a directed 
graph D with vertices 1, . . . , n (referred to as gates) and a sequence b = . . . , 6„) of boolean 
functions. The number of arguments of each function 6j is the same as the indegree of gate i. 
We say that gate j is an input to gate i if (j, i) is an edge of D. A boolean cellular automaton 
B = {D, b) defines a map from {0, 1}" (the set of 0-1 sequences of length n) to {0, 1}" in 
the following way. For each i = 1, . . . ,n let jn, . . . ,jiki be the inputs of i in increasing order. 
Given x = (xi,...,a;„) G {0,1}", B{x) = (6i(xj,,, . . . , . . . , 6„(xj„,, . . . , xj„,J). B may 

be regarded as a finite state automaton with state set {0, 1}" and initial state x. That is, its 
state at time is x, and if its state at time t is ?/ G {0, 1}" then its state at time t + 1 is B{y). 
We put B^{x) for the state of B at time t, and b\{x) for the value of its ith component, or 
gate, at time t. Since the number of states is finite, i.e. 2", there exist times to and ti such 
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that 5*" (a;) = B*^{x). Let ti be the first time at which this occurs. Then = 
for all t > to- We refer to the set of states : t > to} as the state cycle of x in {D, b), 

to distinguish it from a cycle of D in the graph-theoretic sense. The tail of x in {D, h) is 
: t < to}. 

S. Kauffman proposed boolean cellular automata as models of cellular metabolism. 
The gates represent genes within a cell, the state of a gate indicates whether the gene is 
active or inactive, and {D, b) describes how the activity of genes affects other genes. State 
cycles correspond to the possible behavior patterns that the cell can differentiate into. 

In an extensive series of articles (see for example [0, 0, |^), Kauffman described computer 
simulations on a particular kind of randomly constructed boolean cellular automaton. The 
number of gates n was fixed, typically in the range of several hundred to several thousand. 
Every gate had exactly two inputs, chosen independently with equal probability from the 



possibilities. Each gate was also assigned a random boolean function on its two inputs 



from the 16 equally likely possibilities. Then a random starting state was chosen, again with 
a uniform distribution on {0, 1}". The automaton was simulated, and the trajectory of its 
state at discrete time intervals was observed. 

A striking feature of the behavior of these random boolean cellular automata was their 
stability. Typically, the tail length and the size of the state cycle were quite small compared 
to n. Kauffman estimated the median size of the state cycle was on the order of n^^^. Also, 
the automata were very robust. Perturbing the state by flipping the value of one gate usually 
did not affect the state cycle that was entered. 

There are a number of interesting biological implications of these simulations. Perhaps 
the most fundamental is that there may be another major influence on the evolution of 
species besides natural selection. Small, stable state cycles are necessary to the proper 
functioning of any cell. Mutations are random modifications to the genes that control the 
cell. Kauffman's simulations suggest that most mutations preserve orderly behavior, and 
consequently may be passed on. If almost all mutations were selected against, evolution 
would be extremely slow. The evidence that many randomly constructed boolean cellular 
automata exhibit antichaotic behavior supports the idea of modern evolutionary theory that 
chance plays a greater role and natural selection a lesser role in evolution than previously 
thought. 

Although Kauffman's model has been studied empirically for many years, it was only 
recently that formal mathematical methods have been applied to it (see for example fl], 0). 
Luczak and Cohen appear to be the first to prove one of Kauffman's claims-that perturbation 
of a single gate almost never affects the state cycle that the automaton enters. They also 
derived a 2°^"^ upper bound on the size of the state cycle. 

In this article, we study a random boolean cellular automaton that is almost the same 
as Kauffman's (we shall make this precise shortly). Essentially, when boolean functions are 
assigned to the gates, we give a very small bias to those functions that depend on at most one 
argument. Depending on the strength of the bias, we will derive polynomial upper bounds 
on the size of the state cycle entered, and even sublinear upper bounds. We also obtain 
sublinear bounds of the tail length. 
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2 Preliminaries 



We will use the following notions, introduced by Kauffman [Q. 

Definition 2.1 Let f : {0,1}'^ — > {0,1} be a boolean function of k arguments. Then f is 
said to be canalyzing if there is some m = 1, . . . , k and some values m, f G {0, 1} such that for 
all X G {0, 1}'^, if Xm = u then f{x) = v. Argument m of f is said to be a forcing argument 
with forcing value u and forced value v. Likewise, if {D,b) is a boolean cellular automaton 
and bi is a canalyzing function with forcing argument m, forcing value u and forced value v, 
then input jim is a forcing input of gate i. That is, if the value of jim is u at time t, then 
the value of i is guaranteed to be v at time t + 1. 

Note that all four 1-input boolean functions are trivially canalyzing, and all of the 2-input 
boolean functions except equivalence and exclusive or are canalyzing. 

The next definition is borrowed partly from Kauffman |Q and Luczak and Cohen p. 

Definition 2.2 Again, {D, b) is a boolean cellular automaton. Using induction on t, we 
define what it means for gate i to be forced to a value f in t steps. 
// bi is the constant function v, then i is forced to v in steps. 

If all inputs jn, . . . of i are forced to Ui, . . . ,Uk respectively in t steps then i is forced 
to bi{ui, . . . , Uk) in t + 1 steps. 

If bi is a canalyzing function with forced input m, forcing value u, and forced value v, 
and jim is forced to u in t steps, then i is forced to v in t + 1 steps. 

By induction on t it can be seen that if i is forced in t steps, then it stabilizes for all initial 
states X in t steps. That is, for all t' > t, b\ (x) = b\{x). However, the converse is not true. 
It is easy to construct boolean cellular automata without any forced gates but with gates 
that stabilize for all initial states (see Figure 1). 



An example of a boolean cellular automaton without any forced gates, all of whose gates 
become stable. The labels in the circles denote the boolean functions assigned to the gates: 
© is exclusive or, and -> is negation. 

FIGURE 1 
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The next definition is due to Luczak and Cohen 0. 

Definition 2.3 For any gate i in {D,b), let 

Nq^i) = {i} and 
Na+ii-i) = ■ j is an input to i}. 

Then 

c<d 

Note that whether i is forced in d steps is completely determined by the restriction of D and 
b to 

The class of random boolean cellular automata studied in this paper is the same as 
Kauffman's except that the probabilities of assigning boolean functions to gates are slightly 
biased in favor of the six functions that depend on at most one argument: 

f{xi,X2) = Xi, 

f{Xi,X2) = ^Xi, 

/(xi,X2) = X2, 

f{Xi,X2) = -^X2, 

/(xi, X2) = 0, and 

f{Xi,X2) = 1. 



Let e{n) be a function on the natural numbers such that < e{n) for all n and lim„^oo ^{n) = 
0. (We will impose further conditions on e later.) A directed graph D with n gates is 
generated as in Kauffman's model, and the sequence b of boolean functions is generated 
using the distribution 

{1/16 — e{n)/16 if / depends on both arguments 
1/16 + £(n)/16 if / depends on one argument 
1/16 + 3e{n)/l6 if / is a constant function. 

The particular coefficients of e{n) in the above definition were chosen to simplify the notation 
in our proofs. Our theorems apply to any class of random boolean cellular automata where 
the probability of assigning / to gate i is 1/16 — af£{n) for some positive constant a/ when 
/ depends on both arguments, and the probability is 1/16 + afe{n) otherwise. The proofs 
for the general case are more cumbersome, but they do not involve any important additional 
ideas. We will use B = {D,b) to denote a random boolean cellular automaton generated as 
above. For any properties V and Q pertaining to boolean cellular automata, we put pr('P, n) 
for the probability that the random boolean cellular automaton on n gates has property V 
and pT(V\Q,n) for the conditional probability that V holds, given that Q holds. Usually, 
we will omit the n in these expressions since it will be understood. 

The properties of {D, b) that we are really interested in depend only on the function of 
{0, 1}" computed by {D,b). What this means is that if ji and j2 are the two inputs of a 
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gate i and 6j = /, where / depends on only one argument, say f{xi,X2) — Xi, then we can 
delete the edge {j2,'i) from D and replace / by the one argument function g{xi) = Xi. In 
this case we say that the input of i is ji and hi = g. If / is a constant function, we could 
delete both input edges, but for our purposes it is simpler just to delete the second input 
edge and regard / as a constant function of xi. This will make our proofs easier, and is the 
motivation for defining a slightly different class of random boolean cellular automata, which 
is essentially equivalent to the class just defined. 

We take e and n as before. For i = 1, . . . ,n independently, gate i will have two inputs 
with probability 1 — e{n) and it will have one input with probability e{n). If it has two 
inputs then the inputs and the boolean function assigned to it are chosen as in Kauffman's 
model. If it has one input then the input is chosen with uniform probability from among the 
n gates, and its boolean function is chosen from the four possibilities with equal probability. 
We will use B' = (D', h') to denote a random boolean cellular automaton generated this way, 
and pr' for its associated probability function. The following proposition shows that these 
two models of random boolean cellular automata have the same probability distribution with 
respect to the functions that they compute. 

Proposition 2.4 For any natural number n, let B = {D, h) and B' = {D', h') be the two 
random boolean cellular automata with n gates defined above. Then for any function F : 
{0, 1}" ^ {0, lY, pr{B = F) = pr'(S' = F). 

Proof. We need only show the following. 

1. For every gate i, every 1 < ji < j2 < n, and every boolean function / that depends on 
both arguments, 

pr' (inputs of i are ji, j2 and b[ — f) — pr (inputs of i are ji, j2 and bi — f). 

2. For every gate i, every 1 < j < n, and every boolean function g of one argument, 

pr'(input of i is j and b[ — g) = pr (input of i is j and bi — g). 

3. For every gate i and constant boolean function g, 

pr'(&i = g)= w{hi = g). 

To prove 1., 

pr' (inputs of i are ji, j2 and b[ — f) 



(1 - e{n)) X X ^ 

n \ Id 



X 



2 

1 e{n) 



n \ V16 16 
pr (inputs of i are ji, j2 and bi — f). 
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To prove 2., 



n — 1 1 11 
pr'(input of i is j and h^ = g) = (1 — e{n)) x ^ ^ Ta ~^ ^(^) ^ ~ ^ 7 



16 n 



2 X a + ^'«' 



To prove 3., 



n \16 16 
pr(input of i is j and bi = g). 



1 1 

pr'(6^ = ^) = (1 X — +e(n) X - 

1 3e(n) 
= T7: + 



16 16 

pr(6i = g). 



□ 



Thus we make no distinction between {D,b) and {D',b'). 

The following lemma is essentially a generalization of Lemma 1 in |^ . 

Lemma 2.5 For sufficiently large n, any natural number d, and any gate i in the random 
boolean cellular automaton {D, b) with n gates, 



pr(i is not forced in d steps \ (i) induces an acyclic subgraph of D) < —. 

Proof. Let pd be the conditional probability in question. We first show that it satisfies the 
following recurrence. 

Po = 7/S-3e{n)/8, 
Pd+i = (l-£(n)/2)p,-(l-£H)p2/8. 

The base case for po follows from the fact that two out of the 16 2-input boolean functions 
are constant while two out of the four of the 1-input boolean functions are constant. Thus 
Po = {l-e{n)){7/8)+e{n){l/2). 

To prove the induction step, first take the case when i has two inputs. Then i is not 
forced in d + 1 steps if and only if neither of its inputs is forced in d steps and bi is not 
constant, or exactly one of its inputs is forced but its forced value is not forcing for i. The 
first possibility has probability x 7/8, while the second has probability 2pd{l —pd) x 1/2. 
If i has one input, then it is not forced if and only if the input is not forced and bi is not 
constant. This probability is x (1/2). Altogether we have 

Pd+i = il~sin))i7py8+Pdil-Pd))+sin)pd/2 
= (l-e(n)/2)p,-(l-£H)p2/8. 
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Let Qd = 1/pd- We will show by induction on d that qa > d/16 for sufficiently large n, 
from which the Lemma follows. Clearly Qo > 0. 

Assuming > d/16, we use our recurrence for Pd+i, getting 

1/qd+i = (1 - e{n)/2)/qd - (1 - e{n))/{Sql). 



Qd 



[l-s{n)/2)-{l-e{n))/{8qd) 
Qd 



Rearranging, 

Qd+i = 

- l-{l-e{n))/{8qd) 

> qd + {l-e{n))/8 

> + 1/16 for sufficiently large n since e{ri) — > 0. 

Thus qd+i > {d+ 1)/16, and the proof is complete. □ 
Our final basic idea concerns chains of gates that are not likely to stabilize. An unforced 
path is a sequence of distinct gates P = [ii,...,ip) such that i,. is an input of i^+i for 
1 < r < p and none of the gates are forced in 256/ e{n) steps. An unforced cycle is the same 
except ii = ip. 



3 Theorems and Proofs 

In this section, we will prove our upper bounds on the sizes of the tails and state cycles of 
the random boolean cellular automaton {D, h) as a function of e. First, we prove some upper 
bounds on the sizes of unforced structures in {D,b). Throughout the paper, log will mean 
loga- 

Lemma 3.1 If e{n) » 1/logn then 

pr((Z), h) has an unforced path longer than 41ogn/e(n)) = o(l). 

Proof. Let / = [4 logn/£(n)] . The gates in a path of length / can be chosen in n{n — 
1) . . . (n — / + 1) < r?/ ways. Having chosen the / gates ii, . . . ,ii, for r = 1, . . . , / let Vr be 
the event that ii, . . . ,ir form an unforced path. Then the probability that ii, . . . ,ii actually 
form an unforced path is bounded above by 

I 

Y[{{1 - e{n))ar + e{n)Pr), 

r=2 

where 

ar = pr(v_i is an input to v and v is not forced in 256/e{n) steps | 

indegree(v) = 2 and Vr-i), 
Pr — pr(V-i is the input to v and v is not forced in 256/ e{n) steps | 

in degree (v) = 1 and Vr-i)- 



7 



Clearly = l/(2n). 

Assuming v has two inputs and Pr-i holds, let jV 7^ V-i be the other input of ir- Let 
be the event that ^256/£(n)-i(>) is a tree and ^256/e(n)-i(>) ^ U{'^256/£(n)(^^) : s < r} = 0. 
Then 



a, < ( (n - 1)/ ( 2 ) ) X (7r + 5r + Cr) 



where 



7r = pr(not Qr), 

(5r = pr (jr not forced in 256 /e{n) — 1 steps | Qr), 

(r = pT{ir not forced in 256/e{n) steps | Qr and jr is forced in 256/e{n) — 1 steps). 

We now get upper bounds on 7^, 6r, and Cr- 

First, >S'25gy^(„^_j^(jr') is not a tree only if there exist two paths of length at most 256/ 6{n) — 
1 beginning at some common gate and ending at jr- This probability is bounded above by 

256/e(n)-l 256/£(n)-l 

J2 nP{2/n)Pp^ J2 ^'"'(2/^)" < (256/£(n))^25i2/.(n)^-i 

p=0 q=l 

= o{e{n)) since l/s{n) <^ logn. 
The probability that S^^^^^^^^-^^^jr) H U{'5'256/e(n)(^«) : -s < r} 7^ is bounded above by 

256/e(n)-l 

^ ^ 2256A(-) X ^ nP-i(2/n)P = o(£(n)), 

so 

7^ = o{e{n)). 

Assuming Qr holds, the event that jr is not forced in 256 /e{n) — 1 steps is independent 
of the events that ig is forced in 256 /6{n) steps, s < r. Therefore by Lemma |2.5| , 

5r < 16£(n)/255. 

Finally, Cr = 1/2, so ar < + 32e(n)/255 + 0(5(71))). Therefore the probability that 

ii, . . . ,ii form an unforced path is at most 

{n-^lil - £{n)){l + 32£(n)/255 + o{e{n))) + £(n)/2]}'-^ < n-^+\l - £{n)/Ay-\ 

and the probability that there exists such ii, . . . ,ii is at most n{l — £:(n)/4)'^^. Since / = 
\4:\ogn/e{n)], this is asymptotic to 

□ 

Lemma 3.2 If e{n) ^ 1/logn then 

pr((L),6) has an unforced cycle larger than 8 loglogn/5(n)) = o(l). 
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Proof. By Lemma 3J., we need consider only cycles of length at most 4 \ogn/e{n). Summing 
over all cycle sizes from 8 log log n/e(n) to 4 log n/£:(n), and using the same estimates as in 
the proof of Lemma O], the probability is bounded above by 



4\ogn/e{n) x (1 - e(n)/4)Si°s'°snMn) 



< 



4(logn)2 X e-2i°§'°g" 



□ 

Lemma 3.3 If e{n) ^ 1/logn then 

pT{{D,b) has unforced cycles connected by an unforced path) = o(l). 



Proof. By Lemmas |3.1| and |3.2| , we need consider only cycles of size at most 8 loglog?T,/£:(n) 
and paths of length at most 41ogr;,/£:(n). The endpoints of the path can be chosen in at 
most (81oglogn/e(n))^ ways. Summing over all cycle sizes up to 8 log log n/£:(n), all possible 
choices of endpoints of the path, and all paths of length up to 4 log n/e(n), the probability 
is bounded above by 

(81oglogn/e(n))'^ x (41og?2/e(n)) x —* 0. 

□ 

Theorem 3.4 If e{n) ^ 1/logn then 

pr((D,6) has a tail longer than Qlogn/ e{n)) = o(l) 

Proof. After 256/ e{n) steps, the only gates that are not yet stable are those in unforced 
paths and cycles. We may assume that all the cycles and paths are disjoint except possibly 
at the endpoints of the paths. By Lemma p.3| , with probability 1 — o(l), no path begins and 
ends at a cycle (see Figure l.(a)). 

Let I be the length of the longest unforced path in {D,b) and m be the size of its largest 
unforced cycle. After I more steps, the only gates that are not yet stable are those in unforced 
cycles and paths beginning at an unforced cycle (see Figure l.(b)). 

Now consider the state of the gates in these cycles, i.e. the projection of the state of 
{D, b), where we look only at the values of the gates in the unforced cycles. This state will 
reach its state cycle in at most 2m steps. Then {D, b) will reach its state cycle in at most / 



more steps. The Theorem then follows from Lemmas |3.1| and |3]^. □ Typical structure of 
the unstable gates of (-D, b). 

(a) After 2b6/e{n) steps. 

(b) After I more steps, where / is the maximum length of unforced paths that do not begin 
at an unforced cycle. 

FIGURE 2 

Corollary 3.5 If s{n) ^ 1/logn then 

lim pr(all tail lengths of {D,b) are o((logn)^)) = 1. 
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Theorem 3.6 If e{n) ^ 1/logn then 

pr((Z),6) has a state cycle larger than 2i2i°gi°g V^W) = o(l). 

Proof. Any path (or cycle) of unstable gates is also an unforced path (or cycle). By Lemma 
|3.3| , no unforced cycle is connected by an unforced path to an unforced cycle. Therefore no 
cycle of unstable gates is connected by a path of unstable gates to a cycle of unstable gates. 
Then the size of any state cycle of {D, b) is at most the least common multiple of the periods 
of all the cycles of unstable gates. A cycle consisting of s unstable gates has period t or 2t 
for some factor t of s. Let m be the size of the largest cycle of unstable gates. Then the size 
of any state cycle of {D, b) is bounded by twice the least common multiple of all the natural 
numbers less than or equal to m. 

By the Prime Number Theorem (see [Q), the number of primes less than or equal to m is 
asymptotic to mloge/logm. Then the least common multiple of all the numbers less than 
or equal to m is bounded by ui^-^5^/^°sm^ g^^^ ^j^g gj^g of any state cycle of {D, b) is at most 

2^^-4:5771/ log m ^ log log m/e(n) 

by Lemma □ 

Corollary 3.7 Assume e{n) > a log log n/ log n for some constant a. Then there is a con- 
stant c such that 

pr((D, 6) has a tail longer than c(logn)^/ loglogn) = o(l), and 
pr((D,6) has a state cycle larger than n"^) = o(l). 

In particular, if a > 12, then we can take c < 1. 



4 Discussion 

There are a number of problems suggested by our results. An immediate question is whether 
small upper bounds on tail length and state cycle size can be proven for Kauffman's model. 
More generally, the effect of using other distributions on the boolean functions should be 
investigated. Possibly the uniform distribution, where each boolean function has probabil- 
ity 1/16, is a threshold between chaotic and antichaotic behavior. That is, changing the 
probability of a certain type of function from less than 1/16 to greater than 1/16 may al- 
ter drastically the stability of the network. It may be meaningful to group the functions 
into several categories, such as constant functions, functions depending on one argument, 
canalyzing functions depending on two arguments, and non-canalyzing functions. 

Consequences of using functions with more than two arguments should also be studied. 
Compared to the two argument functions, the three argument functions have a much smaller 
proportion of canalyzing functions, and there may be another threshold involving the inde- 
gree of the gates. A network with a significant number of three input gates is likely to have 
more non-canalyzing functions, and these would be less likely to be stable. 
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FIGURE 1 




FIGURE 2 



FIGURE CAPTIONS 



An example of a boolean cellular automaton without any forced gates, all of whose gates become stable. 
The labels in the circles denote the boolean functions assigned to the gates: ® is exclusive or, and -i is 
negation. 

FIGURE 1 



Typical structure of the unstable gates of (D, b). 

(a) After 256 /e{n) steps. 

(b) After I more steps, where / is the maximum length of unforced paths that do not begin at an unforced 
cycle. 

FIGURE 2 



